feat: moderation v2 core backend engine and pipeline by ArthurzKV · Pull Request #333 · openclaw/clawhub

ArthurzKV · 2026-02-15T20:48:24Z

Summary

Introduce moderation v2 backend foundation in ClawHub: normalized verdict/reason/evidence model, deterministic static scanning, publish-time moderation derivation, and backfill support.

Why

Trust decisions need to be consistent and explainable across static, VT, and LLM signals while preserving compatibility with existing moderation fields.

Focused scope

This PR is scoped to one theme: core moderation v2 backend pipeline.

What changed

Added normalized moderation fields in convex/schema.ts.
Added canonical reason code + verdict utilities in convex/lib/moderationReasonCodes.ts.
Added moderation engine in convex/lib/moderationEngine.ts.
Integrated deterministic static scan in publish/backfill paths (convex/lib/skillPublish.ts, convex/skills.ts, convex/vt.ts).
Updated moderation/public safety logic (convex/lib/moderation.ts, convex/lib/public.ts, convex/lib/skillSafety.ts).
Follow-up fixes included:
- escalateByVtInternal moderation flag overwrite bug
- backfill cursor skip edge case
- child_process false-positive fallback in scanner
- rule name alignment to suspicious.nonstandard_network

Local validation

bun run lint:oxlint
bunx vitest run convex/lib/moderationEngine.test.ts convex/skills.rateLimit.test.ts

AI assistance transparency

AI-assisted: Yes (implemented with Codex assistance)
Testing level: Targeted local validation on touched modules
I reviewed the final diffs and understand the behavior changes.

vercel · 2026-02-15T20:48:27Z

@ArthurzKV is attempting to deploy a commit to the Amantus Machina Team on Vercel.

A member of the Team first needs to authorize it.

greptile-apps

_{10 files reviewed, 3 comments}

_{Edit Code Review Agent Settings | Greptile}

convex/skills.ts

convex/lib/moderationEngine.ts

ArthurzKV · 2026-02-15T21:04:25Z

Addressed review feedback in follow-up commits:

ac8fde6:
- fixed escalateByVtInternal so moderationFlags are not overwritten after merge logic.
- fixed backfill cursor advancement to avoid skipping a candidate at batch boundaries.
- fixed child_process exec guard so fallback line text does not create false positives.
beba7a0: renamed network reason code to suspicious.nonstandard_network for naming consistency.

Validation run: lint + moderation engine/rate-limit tests.

…urzKV

…urzKV)

steipete · 2026-03-08T03:13:45Z

Landed via temp rebase onto main.

Gate: bun run lint && bun run build && bun run test
Land commit: 363c395
Merge commit: e31a8e9

Thanks @ArthurzKV!

ArthurzKV mentioned this pull request Feb 15, 2026

feat: moderation v2 trust verification pipeline #332

Closed

greptile-apps bot reviewed Feb 15, 2026

View reviewed changes

convex/skills.ts Outdated Show resolved Hide resolved

convex/skills.ts Outdated Show resolved Hide resolved

convex/lib/moderationEngine.ts Outdated Show resolved Hide resolved

fix: add structured moderation snapshots (openclaw#333) (thanks @Arth…

363c395

…urzKV)

steipete force-pushed the codex/skill-verification-v2-clawhub-core branch from beba7a0 to 363c395 Compare March 8, 2026 03:12

steipete merged commit e31a8e9 into openclaw:main Mar 8, 2026
2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: moderation v2 core backend engine and pipeline#333

feat: moderation v2 core backend engine and pipeline#333
steipete merged 1 commit intoopenclaw:mainfrom
ArthurzKV:codex/skill-verification-v2-clawhub-core

ArthurzKV commented Feb 15, 2026 •

edited

Loading

Uh oh!

vercel bot commented Feb 15, 2026

Uh oh!

greptile-apps bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ArthurzKV commented Feb 15, 2026

Uh oh!

Uh oh!

steipete commented Mar 8, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

ArthurzKV commented Feb 15, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Why

Focused scope

What changed

Local validation

AI assistance transparency

Uh oh!

vercel bot commented Feb 15, 2026

Uh oh!

greptile-apps bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ArthurzKV commented Feb 15, 2026

Uh oh!

Uh oh!

steipete commented Mar 8, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

ArthurzKV commented Feb 15, 2026 •

edited

Loading